Text Mining for Drug Development: Gathering Insights to Support Decision Making

نویسنده

  • Sherri Matis-Mitchell
چکیده

Drug discovery in Pharma R&D is an information driven process requiring many disparate bits of data from many different sources, both structured and unstructured. Text mining is the key methodology used to extract entities and relationships from unstructured text in the quest for the knowledge needed to bring a safe and effective drug to market and beyond. Much of the insight needed in early drug research to identify drug target to disease relationships and progress a potential drug target, comes from published literature and internal reports. Later stage drug development requires many additional sources of information including case reports, clinical trials, competitive intelligence and other diverse sources. In this publication, I will present 4 different use cases on how text mining is used to drive decision making in drug discovery and development and also how it can be used to identify patient insights from sources such as social media Keywords—Text Mining; Drug Discovery; Pharma R&D; Social media; drug safety; patient journey; I. INTRO TO DRUG DISCOVERY Drug discovery began with the use of medical plants to treat illness. Later drug discovery began by extracting the pharmacologically active compound from nature products. Accompanying the genomic revolution, modern drug discovery methods shifted to identification of disease associated genes as potential drug targets, followed by discovery of a compound or biologic that would interact favorably with the target to treat disease. Because of the variability of the human population, a drug can vary in efficacy and safety so extensive preclinical and clinical testing is require by the regulatory agencies before a drug is launched on to market. The modern drug discovery process takes place over an average of 10.7 years and at a cost of about 2.6 billion dollars1,2. To ensure drug safety, constant post launch, monitoring of a drug or biologic is required. Finally, drugs can and do fail at any stage along the process costing millions or billion or in lost revenue and in some cases, causing harm or even death. Drug discovery requires a lot of information to succeed and asking the right question can go very far in ensuring a quicker and surer outcome. By providing quicker understanding of disease and find better disease targets, and finding the right compound, and, identifying potential risk earlier in the process to help make it more efficient and shorten the timeline to a safe medicine. We need to select the right dose to maximize efficacy and minimize risk and develop meaningful trials in the right patient populations to ensure success Finally, even after the drug is launched, companies need to monitor for reports of adverse events that arise following drug treatment but also need to understand what patients are saying to minimize risk, understand competing therapies, and alleviate any new issues arising. Because much of this information is present in written reports, published literature or study reports, text mining can help wade thru the pages and help to uncover facts and relationships in unstructured text3 II. USE CASES FOR TEXT MINING A. Text Mining in Early Drug Discovery Many diseases like diabetes or cancer arise from a complex series of events involving multiple genes and pathways but others, including many rare diseases are associated with a single gene or even a single mutation in that gene. With the help of semantically enriched taxonomies of genes, diseases and drugs and biologics, text mining can identify both old and new relationships between genes and diseases, genes and drugs and drugs and disease4. New drug to disease relationships can represent potential repositioning opportunities. Most drug projects are designed to cure diseases that affect large populations but the rare disease patient community is empowered and are pushing for answers, and initiatives like the Orphan drug act offer incentives to address these rare diseases. Understanding these less complicated but rare diseases that often arise from a single gene mutation can shed light on more complex diseases and text mining has great utility in this use case5. B. Mining for Preclinical and Clinical Drug Safety The ultimate goal of preclinical testing is to accurately model the drug’s safety in animals to predict what will happen in humans. The risk for adverse events can vary across different therapeutic areas and some drug classes inherently have liabilities for certain adverse events6. The tolerance of side effect can also vary across therapeutic areas. Because only a fraction of drugs succeeds, there is a large amount of data from failed compound in unstructured internal reports and study documents as well as published literature. In a small number of cases, unsafe medicines have been progressed into human trials and beyond due to lack of a preclinical safety “signal” in and much is being done to prevent this. In the 1990’s, a number of drugs were found to cause a life threatening cardiac arrhythmia caused by QT prolongation and were withdrawn from the market.7. This has led to testing of all drugs for this liability. One example of how text mining can benefit is in the building of a reference compound set for evaluation of QT prolongation. In 2015, The HESI Pro-Arrhythmia Working Group published on using text mining to identify both human and non-rodent animal studies that assessed QT signal concordance between species and identified drugs that prolonged the QT interval.8 In this work, text mining was essential to identifying compound to biological effect to species relationships in the published literature for expert review. C. The Role of Text Mining in Drug Submission The submission of a new drug to the FDA or requires proof that the medicine is safe and effective as demonstrated by non-clinical testing and clinical trials. The submission package can contain thousands of pages of written material. Text mining can support this process in a number of ways and can positively impact the process by saving the time of project teams and potential reviewers. A real world example of how text mining can impact the submission process will be discussed and while the example has been stripped of proprietary details, it should still demonstrate a tangible value. In this case, the team was filing for a waiver for additional safety studies and based on their knowledge of the drug’s pharmacology and that of other drugs in the same class, the team felt this was warranted but still needed to convince the regulatory agency. A keyword based literature search found 3000 full text documents that needed further review to identify the smaller set of documents relevant to the specific question. The completed review and summary report was due in 7 months and an outside vendor quoted a figure of 9 months and 180,000$ to complete the review. Text mining of the full text documents and subsequent review was then completed in 2 months saving 5 months’ time and 180,000$. While the monetary impact of text mining and other informatics processes R&D processes can be hard to quantitate, this example demonstrates a clear value of text mining. D. Post-launch, Mining Social Media for Patient Insights. Social media is a largely untapped source of information and insights for pharma, on therapy efficacy and safety, patient journey, unmet need, and customer reputation. When a patient receives a life altering disease diagnosis and subsequent treatment, many will turn to social media for support and additional information. As an industry, pharma has been using social media to communicate with patients via channels like Twitter, but this has largely been driven by the commercial to inform the public. While pharmaceutical companies are using social media to provide product information and promotional materials, they should also be using it to better understand patients' needs and experiences, and to provide additional education, particularly to those with chronic illnesses. One emerging trend is to use text mining to analyze sentiments, identify adverse events and glean insights from social media. Social media conversations also can inform R&D and pharmacovigilance efforts.10 Social media is here to stay and pharma should be responsibly engaging in it to get in touch with patients. When companies engage, and have the right tools and analytics capabilities technologies like text mining they can gain valuable insight into what patients are saying and use those insights to make better treatments that improve the quality of lives.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Application of Rough Set Theory in Data Mining for Decision Support Systems (DSSs)

Decision support systems (DSSs) are prevalent information systems for decision making in many competitive business environments. In a DSS, decision making process is intimately related to some factors which determine the quality of information systems and their related products. Traditional approaches to data analysis usually cannot be implemented in sophisticated Companies, where managers ne...

متن کامل

Designing a System for Trend Analysis of Users in Website Surfing in Iran Using Data Mining and Text Mining Algorithms

Background and Aim: As of the entrance of web surfing to the lifestyle of a vast majority of people in the society and the need for a more accurate social and cultural policy making in the field, authors intended to analyze the behavior of the society users in viewing different websites so as to help politicians and practitioners. Methods: Design science research method is used in this research...

متن کامل

A Review of the literature on Applications of Text Mining in Policy Making

Despite the increasing importance of using text mining and social media technologies in policy-making, an in-depth literature review on this area has yet to be conducted. In this paper, we develop a conceptual framework and review 55 articles related to the development of text mining applications for policymaking. The conceptual framework provides a concise illustration of the stages of policy-...

متن کامل

Combining data mining and group decision making in retailer segmentation based on LRFMP variables

Data mining is a powerful tool for firms to extract knowledge from their customers’ transaction data. One of the useful applications of data mining is segmentation. Segmentation is an effective tool for managers to make right marketing strategies for right customer segments. In this study we have segmented retailers of a hygienic manufacture. Nowadays all manufactures do understand that for st...

متن کامل

Product Development Decision Support System Customer-Based

Quality Function Deployment (QFD) has been traditionally used as a planning tool primarily for product development and quality improvement. In this context, many people have used QFD for making decisions on how to prioritize critical product areas from a customer perspective. However, it is the position of the author that the QFD process can be viewed as a decision support system that would enc...

متن کامل

Applying a decision support system for accident analysis by using data mining approach: A case study on one of the Iranian manufactures

Uncertain and stochastic states have been always taken into consideration in the fields of risk management and accident, like other fields of industrial engineering, and have made decision making difficult and complicated for managers in corrective action selection and control measure approach. In this research, huge data sets of the accidents of a manufacturing and industrial unit have been st...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016